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[57] ABSTRACT 

A method of text entry into a device by activating keys of a 
keypad (16), where a key represents various characters. 
Input is delivered to a processor (30) corresponding to the 
keys activated. A dictionary (33) is searched for candidate 
combinations of characters corresponding to the keys acti- 
vated. The candidate combinations are rank ordered, for 
example using language model data and feedback (14, 38) is 
provided to a user indicating at least a highest rank ordered 
candidate combination. 

19 Claims, 6 Drawing Sheets 
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METHOD AND DEVICE FOR INPUT OF non-alpha keyboard, the user has to enter a specific combi- 

TEXT MESSAGES FROM A KEYPAD nation of numbers and/or symbols to indicate the word call. 

HELD OF THF I>rVFNTTnN "^^^ of example for a user to enter the word call the user 

tit,W Ub IHb INVENTION ^ust enter the following: 2***; 2*; 5***; 5***; #. Such a 

This invention relates generally to a data entry method ^ method of inputting is very restrictive, time consuming, and 

and apparatus, and more particularly to a method and editing is difScult and slow. 

apparatus for inputting text messages from a keypad, for Therefore a need exists to provide a user with a simple 

example a numeric keypad. straightforward method and apparatus for inputting alpha- 

nArvrnniiKm nv toc iKn/cvrrrr^KT numeric text through a non-alpha, numeric, or other keypad 

BACKGROUND OF THE INVENTION lO having fewer keys than the desired alphabet; and the need 

With the increasing desire to carry smaller and smaller exists to provide a system that is not restrictive to direct 

personal communicating devices with optimum functional- correlation or a specialized domain, i.e. where a variety of 

ity such as pagers, cellular phones, and other personal choices are available and easily input, 

communicator devices, and the desire to automate homes Various preferred embodiments of the invention are now 

through combinations of telecommunications and cable described, by way of example only, with reference to the 

where smart rooms will have one or more small devices that accompanying drawings, 
operate a variety of appliances or control a variety of 

applications; the need and desire to enter alphanumeric text BRIEF DESCRIPTION OF THE DRAWINGS 

through non-alpha or numeric keypad is ever increasine. vin i • a„ * • r -n 

Tr, 7h^ TTn t.^ Q» t f A ' ^ . i^^ug. 20 FIG. 1 IS a frout view of an illustrative text entry device 

In the Umted States of America it is standard for every suitable to receive input in accordance with the present 

telephone keypad to have letters as weU as numerals dis- invention. y^^^^i 

played on the keys. Thus, the key representing the numeral cm i • ui t j- ^ -,. . 

"2" has the letters A, B and C displayed ^and the key .^.I^V ^ 

representing the numeral "3" has the letters D, E and F etc. „ 

It is common to represent a telephone number by the letters ^}^' ^ ^ ^^^^^ diagram illustrating operation of the 

which correspond to the digits on the keypad. Thus, for ^^vice of FIG. 1 in accordance with a first embodiment of 

example, a raQway company with the name "Amtrack^' can ^® invention. 

acquire the telephone number 268 7225 and can advertise to F^G* 4 is a flow diagram illustrating operation of the 

customers that the company's telephone number can be 3^ device of FIG. 1 in a principal entry mode of FIG, 3. 

dialed by activating the keys with the letters A-M-T-R-A- FIG. 5 is a flow diagram illustrating further details of the 

C-IC Dialing these letters will result in only one digital operation of FIG. 4. 

^nTt^; ^""^ "^^'^^ ""'^''""'^ """^ represented by FIG. 6 is a state diagram Qlustrating operation of the 

many letter sequences. device of FIG. 1 in acco^lance with an alternative embodi- 

In the telecommunications field, methods of name selec- 35 ment of the invention. 

tion are known which use a numeric keypad. For examole T7in 1 u ^« • • • 

it .-c t.««,™ *- • M . . ^Aampic, i^hj 7 is an example of nme pmyin entnes with corre- 

treZ ?f , h'IT^ ' '""^^ "^^''L' '^"'^ "^^^ ^^'^^ embodiment of the invention, 

stream is dehvered to the voice mad system. Hie voice mail 40 » • . ^• 

system compares the input digit stream with a limited set of * ^ ^ ''^^^ "^'^^ram alternative to that of FIG. 6. 

options in a look-up table. Sufficient digits must be entered DETAILED DESCRIFnON OF PREFERRED 

to umqucly identify the dcsued user. Similarly, automated EMBODIMEN-re 
customer service systems can use this form of text entry. For 

example an airline information system can request the first 45 Generally, a method is now described comprising the 

three letters of a destination airport and, notwithstanding ^^^P^ comparing input with dictionary and language 

ambiguities in the letters to be represented by the digits model data stored in memory to generate at least one entry 

delivered, it can uniquely identify an airport from only three selected by probability of corresponding to the input; and 

digits because it has a limited ^'dictionary" of airport names providing user feedback of the entry having a likelihood of 

from which to seek a match. The same approach can be used 50 corresponding to the input. 

in a telephone speed-dial look up arrangement. A name can Particularly, a method of text entry into a device is 
be selected from a limited hst of names stored in a telephone provided comprising the steps of: activating keys of a 
terminal and a number can be dialed associated with the keypad, where a key represents a plurality of characters; 
name selected. delivering input to a processor corresponding to the keys 
Various other approaches for entering alphanumeric input 55 activated; searching a dictionary for candidate combinations 
through a non-alpha, numeric only, or other type of keypad °^ characters corresponding to the keys activated; rank 
having fewer keys or buttons than the conesponding alpha- ordering the candidate combinations; and providing feed- 
bet have been developed or are known. Many of these ^'ack to a user indicating at least a highest rank ordered 
methods take advantage of the letters displayed on the candidate combination. Here, "character" refers generally to 
standard US telephone keypad. However, often the methods 60 ^^^t^rs or to pictographic or grapheme characters, 
associated with the alphanumeric text through the keypad It may be noted that the dictionary is searchable by 
are cumbersome and difficult to edit. In addition, typically combinations of characters, where the characters by which 
the entry must be restricted or entered in a specific manner it is searchable are the same as the characters represented on 
such that there is only one option per sequence of input, the keys. 

More specifically, there is a direct correlation for a specific 65 A method and apparatus are also described for input of 

mput to a specific letter of the alphabet or a symbol. For text through a keypad having fewer keys than the number in 

example, if a user wants to input the word "cair through a the desired alphabet alone or in combination with numbers 



04/15/2004, EAST Version: 1.4.1 



5,952,942 

3 4 

and other symbols ReoogniUon of the input is preferably entry device, English words and letter tiigrams. The lan- 
correiated with the most likely text selection from a guage model data 34 includes unigram weight values for the 
dictionary, language, or compilation of data having an words and letter trigrams stored in the dictionary. OplionaUy 
unrestricted set of characters. the data 34 also includes word bigram and even word 
Language model data is preferably used to rank order the 5 trigram data. Other language model information can be 
candidate combinations in order of probability, Le, likeli- stored with unigram weight values 34. An optional loud- 
hood of usage in a given language. Such language model speaker 38 is shown coupled to the microprocessor, 
data preferably comprises character combination data. The Referring to FIG. 3, the microprocessor 18 is shown as 
character combination data comprises, for example, word ^^^ving three principal modes of operation for data entry (it 
unigram data, letter trigram data, or word bigram or trigram lO ^^^^ other modes and functions not relevant to 
(or n-gram) data or, for example, in a Chinese or other present invention). The first principal mode of data entry 
pictographic or grapheme language version to be described, ^ standard entry mode 40 and associated with this mode is 
it may comprise pinyin letter trigram data, pictographic or ^^^^^ modes of data entry are numeric 
grapheme character unigram data, pictographic or grapheme ^ supervised entry mode 48. Standard entry 
character bigram (or n-gram) data or word (i.e. pictographic is is now described in detail, with the other modes 
or grapheme character combination) bigram (or n-gram) ^^^"^ described later. 

data and the like. Referring now to FIG. 4, a flow diagram of a preferred 

In a preferred embodiment, at least one likely entry can be ^i^^^ent of recogniziDg alphanumeric input in standard 

displayed in a display area of a device and the at least one ^"^T °°°-^^P^^ °' numeric keypad is illustrated, 

likely entry presented in the display area can be edited A example will be considered where the digit sequence 2 

through input of a keypad 2 5 5 is entered. As illustrated, the first input digit "2" is 

«tw ..,h„ ^-„:„ u -I- .7 ' ^ 25 stepl40thenumencsequenceenteredsofarissentontothe 

™il,rH^.»,T r, ^ .^.^^l'^'^ l^l^bouc pager „ext step for comparison against ±e contents of the dictio- 

or personal digital assistant or telecommunications device) „ary 33. Thus, each entry is appended by micioprocessor 30 

comprises a bousing 12. a d^lay l/and Z^Tlt I^-^thTolS fX rSo^^'^ '"^^^^^ 

through which input is received. The display 14 has a text -n, « . j •. "'^""""y 

display atea 15 a^d an area 17 for disphyiig word, letter ? «;^„«>frespond to A. B or C. The 

combination or character altemativerriie kfypad U has ^"^.P""^"". f f'.^P "0 will search for words and 

twelve keys with digits ft-9 displayed thereon in a standard ^r^^fl^ ^^ f ''!^"!?'?? ^V^^ '"""^A 

layout. Also displayed in a standard layout are letteis of the n .'''A """f "^"^ 

Roman alphabet A-Z. THesc letters can'be /rrangcSTn otter T', J?\ °^ ' f^J^ 

arrangements, for example for greater ergonomic efficiency. I'T i^S^^'^^^o ^ """"^ ""J""""- 

but the iUustrated layout is preferred on account of estab- V r ^ f '^'^"''^ "^'^ ''^P 

lished use, THe key bearing the digit "1" has the pun«Sn '"'''.m '""f 1°' letter tngrams). In step 160 all 

marks "?•• and — displayed thereon. THe bw« S PO^^le matches co^elatmg to the input are identified and 

hand key 18 has the symbols "<<•• (meaning "backspac^' o *° ^"^l t ,T u "'^P'°«^• '° ^'^P "0. 

"scroll lefr as is explained below) and ' (referred to as T^f'^ ™f r '"^T-,^/" ^'='»^t«^''!."''' 

"star"). The lower ri^t hand key 19 has the symbols "»" ^"^Tv, u k ""^ '\^'?P '^^ 

(meaning "enter" or "scroll right" as is explained below) and °J T ^ , ''■S''"' /"'"'bjlity are displayed in 

"#" (referred to as "pound") ''•^P'*" '° =^ ^'"^ °°ly ^'S^^ so far 

. . 45 been entered, there is sufficient space in display area 17 to 

,lJh,hlH/iriu ^^'r^' IUustrated, the English display aU three active matches A, B and Cln step 200 the 

alphabet is used AltemaUvely. various other alphabets may program returns to step 120 and awaits the next digit input 

be used m accordance with the preferred embodiments of the when a second input "2" is received, the comparison 150 

present invention, such as Chinese phonetic alphabets or ^ill now yield word^ and trigrams begLintog 
Russian or Asian a phabets. In the pieferr«l embodiment of 50 AC, BA, BB. BC, CA, CB. and CC. If a tfi^ npuTs^ 

JeliLTak^rnT T"^ ^T' 'eceived.the^om^aris^nlSOcouldpoteotiauTyeTdenU^e^ 

device is a keypad having fewer individual keys than the beeinnine with AAJ AAK AAr ABT auk art n at 

number of letters in the alphabet, n^refore. the alphabet is S BAL BWMK^L^' ^^r a^' 

separatedintovariouss^enceswhereasinglenumeraland ^^^^CB^ CBtfiJ^^^a^Ccl^u^s^mJS 

Tbrck ^ftr/r.h°\'" ^^"^^ •^'^-^ ^pp«=- - ^« English i^^grani 

L^^o^i I H 1 ^ ^"u V?" ^1 "'"i'^d by step 150. If a fourth in%. 5, is received, 

Pressmg o a key delivers a digit but the digit can represent wotds beginning with AAJJ. AAJK. AAJL. A^O AAIOC 

one of a pluralUy of letters or characters. ^AKL. AAU, AALK. AAiIl. aSj. ABjOS,^ 

Reterrmg to FIG. 2. the cellular telephone 10 is illustrated ABKK, ABKL, ABU, ABLK, ABLL. BAJJ. BAJK, BAIL 
as having a microprocessor 30 coupled to the input pad 16 60 BAKJ. BAKK, BAKL, BAU, BALK, BALL, BBJJ BBJK,' 

and to the display 14 using standard input and output drivers BBJL. BBKJ, BBKK, BBKL, BBU. BBLK, BBLL BCJJ 

as are known in the art. Also coupled to the microprocessor BCJK. BOL. BCKJ, BCKK. BCKL. BCU, BCLK, BCLl' 

30 are a first memory 31 which is preferably electrically- CAJJ. CAJK. CAIL, CAKJ, CAKK, CAKL, CAU CALK,' 

erasable read-only memory (EEPROM) and a second CALL, CBJJ. CBJK. CBJL. CBKJ, CBKK, CBKL. CBU, 
memory 32 which is preferably random access memory 65 CBLK, CELL. COJ. COK, CCJL CCKJ CCKK,'cCKl' 

(RAM). In the EEPROM memory is stored a dicUonary 33. and CCU. CCLK, and CCLL are poWbiliti'es to be searched 

The dictionary 33 includes, for an English language data in the dictionary (less any combinations ruled out in the 
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previous round of comparison). From this list of aU with a large number of alternatives is ergonomicaUy ineffi- 

combmations, the dicuonary is compared and any matches cient. For these reasons, a language model is used in step 

are identified m step 150. In the example presented, the 170 to assign a probability to each of the matched entries 

entnes with matches found in the dictionary are BALK, identifying the likelihood that the entry corresponds to the 
BALL, CALK, and CALL 5 intended input. Preferably, step 170 causes the entry with the 

Alphanumeric recognition data can be stored in the highest probability of correlating to the alphanumeric input 

memory 31 . to be displayed in area 17 of the display 14 of the device 10. 

Id the preferred embodiment, the dictionary 33 represents Thi® ^ achieved by looking up and comparing uoigram 

a plurality of words or symbols and is preferably stored as weighting values from language model data 34 (and/or other 

a trie, as is described in "Fundamentals of Data Structures in language model data) for the different candidate alternatives 

Pascal" by E. Horowitz and S. Sahni, published by Com- and selecting the most likely candidates (e.g. four candidates 

puter Science Press, pages 512-520, The input is compared other nxunber up to, preferably, a maximum of 10 

against the dictionary and preferably as each input is candidates). 

received the comparison is continually updated. In a pre- An advantage of the present invention is that it allows for 
ferred embodiment, a comparison that will not yield a match ambiguous input and converts that input to a word or 
is eliminated and that combination and any and all further alphanumeric value that has the highest probability or like- 
combinations that include that combination are also then libood of correlating to the input. 

ignored or eUminated. For example purposes only, if a first Referring now to FIG. 5, details of the subroutine 130 of 

and a second input are 2 2, and the comparison with the FIG. 4 are shown 

t^^T2 f^^^,^^^ "^"^T"? ^^'^ °f one or more digits, and display of 

entnes that would match BB all further mput that would the candidate matches on display area 17, key 19 is pre^d 

begin with BB combmauons ^ elimmated or ignored. For bearing the markings the prograi^^oLe^s oTep 

urti^L^^^^^ commencement of an endtf' 

e wLte^^^^^^^^^^^ ' ?f V^h"'"- °' 25 ^^^'^ terminating the entry of the current word and 

ehminated and not compared to the dictionary. 25 ^^^^^ ^^^^^^^^ ^ ^^^^^^ ^^^^ ^^^^^^^ 

In accordance with a preferred embodiment of the present a space. 

™H^n nf ^- ^""l rf.^ """i T accelerated the At this point in the process, the highest priority match is 

AdvLZ. nf th t f I ^'"""^ "^""T"^ u"^""* ^^P^^y^ ^ alternatives display area 17 of display 14. TTie 
^^rS"^ 30 P^^gtam awaits new date entry at step 190. If the word or 

for compar^n and provides results wiUi a greater accuracy character displayed in display area 17 (the highest ranked 

of correlating to the input. Breadth-first searches are ^atch) is not the desired word or character-e.g. if CALL 

^^^rl2ir^:~^^^^^^ ^''^rtl'u' ^ ^^^P^^y^^' BALL is desired, pressing (briefl^l^d 

Computer Problem SoWmg by Judea Pearl, pubhshed by releasing key 19 (#») causes step 193 to cycle to the next 

^nfrXl^:: !^ 35 ^^^^^^^ With each prei of key 19!steps m and 193 c^^ 

^n^U^^^^ . L^'*°'''x,^; the next rankordered choice to be displayed in alternatives 

published by M.LT Press and McGraw-Hill at page 469 et display area 17 and in text display area 15 (eventually 

^' J u . returning to the original highest rank-ordered choice). If, in 

in a preferred embodunent of the present invention, the step 190, the input key 18 («*) is pressed, step 198 causes 

dictionaty whose contents are stored as a trie is searched the entire sequence of digits since the last space (the entire 

using a beam search. Various techniques for beam searches word or character) to be deleted and the program recom- 

are known in the art. For present purposes, a beam search is mences at step 120, where a new input can be received and 

a breadth-first search with limited breadth, where only, say the word can be re-entered. If the entry at step 190 is neither 

100 possible matches are considered at any moment This of keys 18 and 19 then logicaUy it must be another of the 

feature is particulariy advantageous because spelling errors keys of keypad 16, representing a digit 0-9 and step 197 is 

are permitted. Preferably, a restriction is established limiting reached. This entry is interpreted to be the start of a new 

the number of alternatives that wiU be searched. word or character. In step 197 the end-of-word mode ends 

More preferably, a restriction is established which and a new word (or character) begins. The program proceeds 

restricts or eliminates all alternatives from the dictionary to step 140 for processing of the new digit stream repre- 

that are returned which fall below a predetermined "score." 50 senting the new word or character. 

The score that is selected will vary depending upon the Other steps can be introduced. For example press-and- 

parameters that are determined to be the most significant for hold activations of various keys during the execution of 

the application for which the invention is being used. In a steps 182-198 can be given additional meanings and func- 

preferred embodiment, an alternative receives a high score tions. 

if the entry corresponds to an entry in the dictionary. If, 55 if, foUowing step 120, key 18 is pressed, step 186 causes 

however, the alteraaUve does not correspond to an entry in the program to proceed to step 188 and the last digit entered 

the dictionary, the alternative is given a low score. is deleted. The program then continues with step 140 and the 

More preferably, eadi alternative that does not match an numeric sequence so far entered (omitting the last digit) is 

entry in the dictionary is eliminated. Preferably, for each delivered to comparison step 150 for processing as before, 

alternative that does match an entry in the dictionary, the eo It will, of course, be appreciated by one of ordinary skill 

alternative is compared to a unigram language model to in the art of computer programming that the steps illustrated 

determine the frequency of occurrence of that alternative. in FIGS. 4 and 5 can be implemented in many different 

In the preferred embodiment, all input that matches or forms and do not have to be implemented in the sequences 

corresponds to an entry in the dictionary at step 150 of FIG. shown. 

4 is returned to the memory 32. In many instances, there are 6S If at any time in mode 40 (except between steps 184 and 

too many possible matches for all matches to be displayed 195) the desired word to be entered in the text display area 

m akernatives display area 17. Moreover, presenting a user 15 is not the highest priority word, key 19 can be pressed 
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dowQ and held down for a predetermined time (e.g. 1.5 
seconds) and this will cause cycle mode 44 (FIG. 3) to 
operate and the candidate matches in display area 17 will 
cycle through (cycling to the left if more than one is 
displayed). Upon release of the respective button, standard 5 
entry mode 40 resumes. Pressing- and-holding of key 19 
between steps 184 and 195 can be given a different meaning 
(because pressing and releasing that key at that time already 
operates the function of cycling through the choices). It is 
preferred that here pressing-and-holding key 19 has the 10 
effect of entering a space in text display area 15. 

When the desired word is displayed in the left-most 
position in the display area 17, it can be selected by pressing 
key 19 briefly (less than the predetermined time) and releas- 
ing it. This action delivers the word to memory 32 and to text is 
display area 15 as described above. 

If the display area 17 is small in comparison to the length 
of the candidate matches, there may be only one candidate 
displayed at one time and key 19 will cause the next 
candidate (in rank order of likelihood of selection) to be 20 
displayed. 

The standard mode of text entry 40 described and illus- 
trated in FIGS. 4 and 5 allows for very fast text entry firom 
multiplexed keys where each key represents a plurdity of 
letters or characters. For example, 65% of English text is 25 
written using only 100 words. 91% of these words can be 
entered without having to use keys 18 and 19 to scroll 
beyond the first presented match. Thus the arrangement 
described comes close to meeting a goal of one keystroke 
entry per letter. 30 

Where the words to be entered are all included in the 
dictionary standard entry mode 40 (and associated modes 42 
and 44) can be used continuously to enter text. 

In alternative embodiment of standard entry mode 40, a 
bigram language model (two-word combination model) is ^" 
used to determine the frequency of two consecutive words 
occurring together and to rank order the candidate matches 
accordingly. This embodiment, however, can require a large 
memory 31, especially when operating in the English 
language, on account of the very large number of possible ^ 
bigrams. Atrigram language model can be used to determine 
the frequency of three words occurring together. 

In accordance with preferred features of the present 
invention, alternative data entry modes are provided. 

Numeric entry mode 46 can be entered by pressing and 
holding a key oonresponding to the digits 0-9, Upon pressing 
and holding one of these keys, the different multiplexed 
characters (digit and letters) are displayed in cyclic order in 
the display area 17. For example pressing and holding the 
"2" key causes the characters A-B-C-2-A-B-C-2 to be 
displayed in sequence (or 2-A-B-C etc.). If the key is 
released while a nimieral is displayed, numeric entry mode 
46 is activated. If it is released while a letter is displayed, 
supervised entry mode 48 is activated. 

Once numeric entry mode 46 is activated, every subse- 
quent key press is assumed to be a number, until key 19 is 
pressed. Thus the number 8211223 shown in FIG. 1 can 
easily be entered by one long press of key "8" followed by 
a short press of each of the subsequent keys. ^ 

Upon pressing key 19 the program returns to standard 
data entry mode 40. 

When supervised data entry mode 48 is entered, each 
subsequent letter can also be entered by a similar press-and- 
scroU method of letter selection (as is known in the art). 65 

As a preferred alternative, supervised data entry mode 48 
operates in the same way as standard entry mode 40 but uses 
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only letter irigrams to select the next candidate character. 
When key 19 is pressed, supervised data entry mode ends, 
the di^layed (or in the preferred alternative the highest 
priority displayed) word is stored in memory 32 and dis- 
played in text display area 15 and the program returns to 
standard entry mode 40. Supervised entry mode is particu- 
lariy useful for entry of words not included in the dictionary 
33 and for entry of names 

In the embodiment described above, a predetermined key 
(e.g. "^') is preferably to select and enter the highest rank 
ordered candidate combination. 

A memory (e.g. EEPROM 31 or a separate computer disk) 
can be provided having instructions and data stored therein 
that, when executed, cause the microprocessor 30 to com- 
prise: an input; a dictionary 33; language model data 34; a 
search algorithm 150, to search the dictionary for candidate 
combinations of characters corresponding to digits received 
at the input (input from input pad 16); a rank ordering 
function 170 that rank orders the candidate combinations 
according to the language model data; and an output (to 
display 14 or loud^eaker 38) to provide feedback indicating 
at least a highest rank ordered candidate combination. 

There now follows a description of how the above 
embodiment of a data entry device can be modified for very 
simple and fast entry of Chinese characters. In this embodi- 
ment it is preferred (but not essential) that one of a plurality 
of keys (e.g. 0-9) is activated to select and enter one of a 
plurality of rank ordered candidate combinations. 

Since the mid 1950s, a standard phonetic representation 
of Qiinese characters has been adopted known as "pinyin". 
The pinyin for each Chinese character, like each English 
word, is made up from combinations of the 26 letters of the 
Roman alphabet A-Z. 

So a pinyin based Chinese keypad data entry device can 
have the same keypad layout as that shown in FIG. 1. The 
mapping table of telephone digit keys and 26 English letters 
is as shown in FIG. 1. (Alternatively the digits 7 through 0 
can represent <p r s>, <t u v>, <w x y>, <q z> respectively 
or digits 7 through 9 and digit 1 can represent <p r s>, <t u 
v>, <w X y>, <q z> respectively.) 

Referriog to FIG. 6, a mode diagram for a program 
operated by microprocessor 30 is shown for this embodi- 
ment of the invention. There is a standard entry mode 300 
with an associated character selection mode 302. Character 
selection mode has scroll left and scroll right modes 304 and 
306 as sub-modes. An alternative alphanumeric entry mode 
310 is provided 

In standard entry mode 300, a user enters the pinyin of 
each Chinese character he wants to input by using the 
telephone keys marked a,b,c,d, ... in the same manner as 
described above with respect to entry of English words. (For 
verification, a digit string is di^layed in a small window 13 
which is located at the top right of the display 14), He can 
press key 18 to delete a digit that is wrong. When finished, 
he presses key 19. The program then proceeds to character 
selection mode 302 and displays in character selection area 
17 of the display the top 10 Chinese characters that have the 
pinyin entered. The characters are rank ordered based on the 
language model data 34. A number from 0 to 9 is displayed 
under each Chinese character. The user can select the 
Chinese character he wants to input by pressing its corre- 
sponding number displayed. For example, if the Chinese 
character marked by 3 is the desired one, he simply presses 
the key marked by 3 nto put it in the text area 15 of the 
display 14 and store it in memory 32. The program then 
returns to standard entry mode 300. 
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If the user can Dot find the Chinese character he wants to standard mode 300, when pressing and holding key 0 for a 

input in the display area 17, he can press key 19 again, then predetermined time (e.g. 1-5 seconds), the program enters 

mode 304 of the program gives a list of another 10 charac- the Chinese punctuation and symbol entry mode 620 and 

ters and these are displayed in display area 17. The user can displays in character selection area 17 of the display digit 0, 
revert to the previous list of 10 characters by pressing key 18 5 space, punctuation marics, and other symbols 
to activate mode 304. Since the dictionary 33 contains every 

possible Chinese character, the user can always find the right q . j 7/ 

character be wants to input. Eg. : 

0123456789 

If the pinyin the user entered is not a legitimate pinyin of 
a Chinese character, the system displays nothing and gives 

a beep through the loudspeaker 38 or leaves the digit stream A number from 0 to 9 is displayed under each symbol (or 

displayed unchanged m wmdow 13 and gives a beep. item). The user can select the Chinese symbol he wants to 

Once the user has entered the pinyin followed by key input by pressing its corresponding number displayed. If the 

19, the two editor keys and become the selecting user cannot find the symbol he wants to input, he can press 

keys for the top 10 candidate Ust, one is forward, the other key 19, then mode 620 of the program gives a list of another 

IS backward The user must make his selection by pressing 10 Chinese symbols and these are displayed in the display 

one of 10 telephone digit keys even if he can not find the area 17. The user can revert to the previous list of 10 

Chmese character he wants to mput. This returns the opera- symbols by pressing key 18 to activate mode 620, There are 

tion to standard entry mode 40 and key 18 can be used to more than 600 Chinese symbols 

ttrlVH?°*Tt'^T''' '^"^^ "^"^ P^^^ 20 ^y«^^^ keys 1-9 to enter ASCII symbols, which 

for the desired character ^^^^^^ English alphanumerics, punctuation marte, and 

An example of nme pmyin entnes, with corresponding other symbols. In standard mode 300, when pressing/ 

digit streams 502 and examples of selected characters 504 holding key 1 for a predetermined time (e g 1-5 seconds) 

corresponding to those pinyin entries is shown in FIG. 7. the program enters mode 610 and displays digit 1, space! 

There are four frequently used punctuation marks punctuation marks, and other ASCII symbols. 
"?", " !" marked in key 1. The system uses the press/hold 

mode to enter one of them. When the user wants to input the ]^ .?!-$#© 

punctuation, he presses and holds key 1. The program enters ^ e- • 01 2^56789 
mode 310, Then the digit 1 and the four punctuation marks 
**,*', "?", "!" are then displayed on the screen one by one. 

The user can select the punctuation he wants to input by A number firom 0 to 9 is displayed under each symbol (or 

release key 1 when it is being displayed. item). The user can select the ASCII symbol he wants to 

In a manner similar to the input of punctuation, alphanu- input by pressing its corresponding number displayed. If the 

meric entry mode 310 is entered in a press/hold manner to user cannot find the symbol he wants to input, he can press 
enter digits and Roman letters. For example, if the user 35 key 19, then mode 610 of the program gives a list of another 

wants to enter the Roman character M, he presses and holds 10 ASCII symbols and these are di^layed in the display area 

key 6 which is marked by M, N, 0. The system then displays 17 The user can revert to the previous list of 10 symbols by 

digit 6, the small cases of these three characters and the pressing key 18 to activate mode 610. 

capital cases of these three characters one by one. The user Other digits and Roman letters can also be entered in a 
then releases the key when he sees the character M on the ^ press/hold manner. For example, if the user wants to enter 

screen. Roman character M, he presses and holds key 6 which is 

Referring to FIG. 8, a state diagram alternative to that of marked by M, N, O. The system then enters mode 610 from 

FIG. 6 is shown. In this diagram, standard entry mode 300 mode 300 and displays digit 6, the small and capital cases of 

as well as Chinese (or other pictographic or grapheme) English letters m, n, o, like this 
character selection mode 302 and scroll right and scroll left 45 

modes 304 and 306 are unaltered. Mode 310 of FIG. 6 is 6mno\fNOExit 
split into two alternative modes in FIG. 8, these being 

Chinese punctuation and symbol mode 620 and ASCII 01234567 
symbol entry mode 610. Mode 620 is reached from mode 

300 by pressing and holding key "0;\ Mode 610 is reached 50 -j^e user can then select the capital letter M by pressing key 

from mode 300 by pressmg and holding one of keys "1-9" a ^.u V ^ , ^ prea&mg jtey 

Tlcse modea 620 ^i,d 610 have respective s«o!l nW and !'l^L''HT,T,'^T%°1'r- h fr^'"'''^'"'*^^ 

scoU left modes 621. 622. 611 and 612 sunilar to modes 306 ""^'^/^f ' ^'°'u° 

and 304. AddiUonaUy insert return mode 601 is provided !l*i7lT'^n , ^"""^ '° "^T '° 

which is reached from mode 300 by pressing ke? 19 (#) M^fni"',,^" any Roman letters. 

without first entering any pinyin. Mode 302 to 8 ^} "^^T '"^TH^'^u '° ^ bet^^^ 

««i . u J u • 1 /^A I \ . . ' words or characters. Mode 632 is the same as mode 44 in 

Mtir^i'tnH^r?'!!^^ H^i? W,fft" ^-'^^i P^yi"- FIG. 1 and mode 631 is similar. 

Modes 631 and 632 are added (scroll right and scroll left ^ r ttt/^o i i j ^ o 

respectively) which can be reached from mode 300 by .J^V^^^r^ °^ T . ' « °' « be to 

pressing and holding respective keys 19 and 18 for a '^'"g Zh"y>" pho-^^tic Chinese symbols mstead of 

JredeteLned time (f .g. iTseconds) '° Z^i ^7^"' Roman) symbols, by smtab y displaying 

<-.A ■ , fi 1 u • ^ ^ ■ Zhuym phonetic Chmese symbols on 10 telephone 

5oS^l^^?!,nn^H^.hr'"';Jl? ^?r' *r ^'^"i keypadkeysandsultablyprovidingmatchingdictionaSrand 

500 punctuation and other symbols that do not form parts of language model data ^ ' 

words but stand atone. The^ can be selected by pressing it wiU be apparem to those skilled in the art that the 

C and scroUing left or right to select the desired symbol. 65 disclosed invention maybe modified in numerous ways and 

The system uses key 0 to enter Chinese punctuaUon marks may assume many embodiments other than the preferred 

and other Chinese symbok. such as space, slash, etc. In forms particularly set out and described above. For example. 
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instead of providing a display 14 and user feedback through 
area 17 of the display, synthesized voice can be output 
through the speaker 38 to provide user feedback of a 
candidate match. Additionally, the input pad 16 and loud- 
speaker 38 can be remote from the microprocessor 30 and its 5 
memories 31 and 32, providing remote data entry. This 
anangement is particularly useful for allowing a standard 
telephone handset to be used for text entry, for example for 
entering paging messages to a paging system from a remote 
location. Accordingly, it is intended by the appended claims 
to cover all modifications of the present invention that fall 
within the true spirit and scope of the present invention and 
its equivalents. 
What is claimed is: 

1. A method of text entry into a device comprising the 
steps of: 
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activating keys of a keypad, where a key represents a 
plurality of characters in a first alphabet; 

delivering input to a processor corresponding to a set of 
plural keys activated; 

searching a dictionary for a set of alternative characters in 
a second alphabet, each alternative character ambigu- 
ously corresponding to the set of plural keys activated, 25 
to thereby provide candidate second alphabet charac- 
ters; 

rank ordering the candidate second alphabet characters 
utilizing language model data indicative of likelihood 
of usage in a given language to order the candidate 
second alphabet characters in order of probability; and 

providing feedback to a user indicating at least a highest 
rank ordered candidate second alphabet character. 

2. The method of claim 1, further comprising the step of 35 
activating a predetermined key to select and enter the 
highest rank ordered candidate second alphabet character. 

3. The method of claim 1, further comprising the step of 
activating one of the keys of the keypad to select and enter 
one of a plurahty of rank ordered candidate second alphabet 40 
characters. 

4. The method of claim 1, wherein the language model 
data comprises unigram data. 

5. The method of claim 1, wherein the language model 
data comprises letter trigram data. 45 

6. A device comprising: 

a ke>rpad, said keypad having an input area of discrete 
keys, each key representing plural characters in a first 
alphabet, and a display area, said input area configured 
to generate numeric input upon activation of the keys; 

display processing circuitry for presenting a graphical 
image of a second alphabet ideographic character on 
the display; 

a memory having at least alphanumeric recognition data 55 
stored therein; and 

digital processing circuitry for comparing the numeric 
input with the alphanumeric recognition data and for 
rank ordering the candidate second alphabet characters 
utilizing probability data indicative of likelihood of 60 
usage in a given language to order the candidate second 
alphabet characters in order of probability to provide to 
the display processing circuitry candidate recognition 
information including at least one likely candidate 
second alphabet ideographic character having a highest 65 
probabihty of corresponding to the numeric input 
value. 



7. A method comprising the steps of: 
receiving input data from a keypad area of a device having 

a set of keys, each key representing a plurality of 
characters such that activation of a key generates an 
ambiguous input signal having a probability of repre- 
senting any one of the plurality of characters for that 
key; 

comparing the input data with a dictionary and word 
bigram language model data to identify at least one 
likely entry likely to represent the input data; and 
displaying the at least one likely entry in a display area of 
said device. 

8. A method comprising the steps of: 
comparing input data received from a keypad area of a 

device with a dictionary, the keypad area comprising a 
set of keys, each key representing a plurality of char- 
acters such that activation of a key generates an 
ambiguous input signal having a probability of repre- 
senting any one of the plurality of characters for that 
key; 

identifying candidate words from the dictionary corre- 
sponding to the input data; 
identifying, using word bigram language model data, at 
least one likely candidate word having a highest prob- 
ability of representing the input data; 
storing the at least one likely candidate word in memory; 
displaying the at least one likely candidate word in a 

display area of the device; and 
editing the at least one likely candidate word presented in 
the display area through mput of the keypad area. 

9. A method of processing keypad entry into alphanu- 
meric text, comprising the steps of: 

receiving input as a sequence of numeric values; 
converting the sequence of numeric values to at least one 

alphanumeric unit; 
comparing the at least one alphanumeric unit with a 
dictionary to identify candidate words likely to repre- 
sent the input; 

assigning through word bigram data a probability to each 

of the candidate words; and 
displaying a candidate word having a highest probability. 

10. A device comprising: 

a keypad, said keypad having an input area and a display 
area, said input area configured to receive numeric 
input; 

display processing circuitry for presenting a graphical 

image on the display; 
a memory having at least alphanumeric recognition data, 

including word bigram data stored therein; and 
digital processing circuitry for comparing the numeric 
input with the alphanumeric recognition data to provide 
to the display processing circuitry candidate recogni- 
tion information including at least one likely candidate 
word having a highest probability of corresponding to 
the numeric input value, as detennined by the word 
bigram data. 

11. A method of text entry into a device comprising the 
steps of; 

activating keys of a keypad, where a key represents a 
plurality of characters and activation of each key gen- 
erates an ambiguous input signal having a probability 
of representing any one of the pluraUty of characters for 
that key; 
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delivering input to a processor corresponding to the keys 

activated; ^f. 

searching a dictionary for words corresponding to the 

keys activated to provide candidate words; 
rank ordering the candidate words utilizing word bigram 

language model data to order the candidate words in 

order of probability; and 
providing feedback to a user indicating at least a highest 

rank ordered candidate word. 

12. The method of claim 11, further comprising the step 
of activating a predetermined key to select and enter the 
highest rank ordered candidate word. 

13. The method of claim 11, further comprising the step 
of activating one of the keys of the keypad to select and enter 
one of a plurality of rank ordered candidate words. 

14. A text entry device comprising: a keypad having a set 
of keys, each key representing a plurality of characters such 
that activation of a key generates an ambiguous input signal 
having a probability of representing any one of the plurality 
of characters for that key; a dictionary; word bigram lan- 
guage model data; an output device; and a processor coupled 
to the keypad, dictionary, language model data and output 
device, 

the processor being programmed to receive input corre- 25 
spending to keys activated, to search for candidate 
combinations of characters corresponding to the keys 
activated, to rank order the candidate combinations 
according to the word bigram language model data and 
to provide feedback through the output device indicat- 30 
ing at least a highest probability rank ordered candidate 
combination. 

15. A method of processing keypad entry into alphanu- 
meric text, comprising the steps of: 

receiving input as a sequence of numeric values where 35 
each numeric value has a probability of representing 
any one of a plurality of characters; 

converting the sequence of numeric values to at least one 
alphanumeric txnit; 

comparing the at least one alphanumeric unit with a ^ 
dictionary to identify candidate words likely to repre- 
sent the input; 

assigning through word bigram data a probability to each 
of the candidate words; and 

45 

displaying a candidate word having a highest probability 
data as determined by the word diagram data. 
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16. The method of claim 15, further comprising the step 



editing the candidate word having the highest probability. 
17. The method of claim 15, wherein the step of com- 
paring the at least one alphanumeric unit with a dictionary 
is completed through use of a breadth-first search. 
IS, A device comprising: 

a keypad, said keypad having an input area of discrete 
keys, each key representing a plurality of characters, 
and a display area, said input area configured to receive 
numeric input such that activation of a key generates an 
ambiguous input signal having a probability of repre- 
senting any one of the plurality of characters for that 
key; 

display processing circuitry for presenting a graphical 
image on the display; 

a memory having at least alphanumeric recognition data 
stored therein; and 

digital processing circuitry for comparing the numeric 
input with the alphanumeric recognidon data and for 
assigning word bigram probabilities to words repre- 
sented by the alphanumeric recognition data, to provide 
to the display processing circuitry candidate recogni- 
tion information including at least one likely candidate 
word representing having a highest probability of cor- 
responding to the numeric input value as determined by 
the word bigram probabilities. 
19. A device for text entry comprising: 
a memory having instructions and data stored therein that, 
when executed, cause a digital processor to comprise: 
an input; 
a dictionary 

at least word bigram language model data; 
a search algorithm, to search the dictionary for combina- 
tions of characters and words corresponding to digits 
received at the input to provide candidate combinations 
and words; 

a rank ordering function that rank orders the candidate 
combinations and words according to the word bigram 
language model data; and 

an output to provide feedback indicating at least a highest 
rank ordered candidate combination or word. 
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